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1 Introduction 

During the last few years various models of networks [1, 2] have become a 
powerful tool for analysis of complex systems in such distant fields as Internet 
[3], biology [4], social groups [5], ecology [6] and public transport [7]. Mod- 
eling behavior of economical agents is a challenging issue that has also been 
studied from a network point of view. The examples of such studies are mod- 
els of financial networks [8], supply chains [9, 10], production networks [11], 
investment networks [12] or collective bank bankrupcies [13, 14]. Relations 
between different companies have been already analyzed using several meth- 
ods: as networks of shareholders [15], networks of correlations between stock 
prices [16] or networks of board directors [17]. In several cases scaling laws for 
network characteristics have been observed. 

In the present study we consider relations between companies in Poland 
taking into account common branches they belong to. It is clear that com- 
panies belonging to the same branch compete for similar customers, so the 
market induces correlations between them. On the other hand two branches 
can be related by companies acting in both of them. To remove weak, acci- 
dental links we shall use a concept of threshold filtering for weighted networks 
where a link weight corresponds to a number of existing connections (common 
companies or branches) between a pair of nodes. 

2 Bipartite graph of companies and trades 

We have used the commercial database "Baza Kompass Polskie Firmy B2B" 
from September 2005. It contains information about over 50 000 large and 
medium size Polish companies belonging to one or more of 2150 different 
branches. We have constructed a bipartite graph of companies and trades in 
Poland as at Fig. 1. 
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Company 1 Company 2 Company 3 Company 4 




Fig. 1. Bipartite graph of companies and trades. 

In the bipartite graph we have two kinds of objects: branches A = 

1, 2, 3....Nb and companies i = 1,2,3 Nf, where Nb - total number of 

branches and Nf - total number of companies. Let us define a branch ca- 
pacity \Z(A)\ as the cardinality of set of companies belonging to the branch 
A. At Fig. 1 the branch A has the capacity \Z(A)\ = 2 while \Z(B)\ = 3 
and |Z(C)| = 1. The largest capacity of a branch in our database was 2486 
(construction executives), the second largest was 2334 (building materials). 

Let B(i) be a set of branches a given company i belongs to. We define a 
company diversity as |-B(i)|. An average company diversity fi is given as 



i=N 

1 



f i=i 



For our data set we have fi = 5.99. 

Similarly an average branch capacity v is given as 

A=N b 

» = jrH\ z ( A )\ ( 2 ) 

and we have v — 134. 

It is obvious that the following relation is fulfilled for our bipartite graph: 

— = JL. ( 3 ) 

N f N b y ' 



3 Companies and trades networks 

The bipartite graph from Fig. 1 has been transformed to create a companies 
network, where nodes are companies and a link means that two connected 
companies belong to at least one common branch. If we used the example 
from Fig.l we would obtain a companies network presented at Fig. 2. 

We have excluded from our dataset all items that correspond to com- 
munities (local administration) and for our analysis we consider Nf = 48158 
companies. All companies belong to a single cluster. Similarly a trade (branch) 
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Fig. 2. Companies network. 



network has been constructed where nodes are trades and an edge represents 
connection if at least one company belongs to both branches. In our database 
we have Nb = 2150 different branches. 




Fig. 3. Trades network. 



4 Weight, weight distribution and networks with cutoffs 

We have considered link-weighted networks. In the branches network the link 
weight means a number of companies that are active in the same pair of 
branches and it is formally a cardinality of a common part of sets Z(A) and 
Z(B), where Z{A) is a set of companies belonging to the branch A and Z{B) 
is a set of companies belonging to the branch B. 

w AB = \Z(A)nZ(B)\ (4) 

Let us define a function f£ which is equal to one if a company k belongs to 
the branch A, otherwise it is zero. 
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Using the function f£ the weight can be written as: 

JV F 

WAB = J2ttfk (6) 
fe=i 

The weight distribution p(w), meaning the probability p to find a link with a 
given weight w, is presented at Figure 4. The distribution is well approximated 
by a power function 

p(w) ~ (7) 




Fig. 4. Weight distribution in branches network. 



where the exponent 7 = 2.46 ± 0.07. One can notice the existence of edges 
with large weights. The maximum weight value is w max = 764, and the average 
weight 

(w) = wp(w) (8) 

equals (w) = 4.67. 

Using cutoffs for link weights we have constructed networks with different 
levels of filtering. In such networks nodes are connected only when their edge 
weight is no less than an assumed cutoff parameter w Q . 
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Table 1. Data for branches networks: w is the value of selected weight cutoff, N 
is the number of vertex with nonzero degrees, E is the number of links, k max is the 
maximum node degree, (k) is the average node degree, C is the clustering coefficient. 
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A weight in the companies network is defined in a similar way as in the 
branches networks, i.e. it is the number of common branches for two companies 
— formally it is equal to the cardinality of a common part of sets B(i) and 
B(j), where B(i) is a set of branches the company i belongs to, B(j) is a set 
of branches the company j belongs to. 

"ij = \B(i)r\B(j)\ (9) 
Using the function f£ the weight can be written as 



A=l 



(10) 
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The maximum value of observed weights w max = 207 is smaller in this 
networks than in the branches network while the average value equals (w) = 
1.48. The weight distribution is not a power law in this case and it shows an 
exponential behavior in a certain range. 

Similarly to the branches networks we have introduced cutoffs in companies 
network. At the Fig. 5 we present average degrees of nodes and maximum 
degrees as functions of the cutoff parameter w - We have observed a power 




law scaling 

{k)~ w -t (11) 

where for branches networks a b = 1.069 ± 0.008 and /3 b = 0.792 ± 0.005 
while for companies networks a/ = 2.13 ± 0.07 and (3f = 1.55 ± 0.04. 



5 Degree distribution 

We have analyzed the degree distribution for networks with different cutoff 
parameters. At Fig. 6 we present the degree distributions for companies net- 
works for different values of w . The distributions change qualitatively with 
increasing w a from a nonmonotonic function with an exponential tail (for 
w — 1) to a power law with exponent 7 (for w > 6). 

Values of exponent 7 for different cutoffs are given in the Table 3. 

Now let us come back to branches networks. At the Fig. 7 we present a 
degree distribution for w Q = 1. We observe a high diversity of node degrees 
— vertices with large values of k occur almost as frequent as vertices with a 
small k. 
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Table 2. Data for companies networks: w is the selected cutoff, N is the number 
of nodes with nonzero degrees, E is the number of links, k max is the maximum node 
degree, (k) is the average node degree, C is the clustering coefficient. 
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Table 3. Values of exponent 7 for different cutoffs w in companies networks. 
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For a properly chosen cutoff values the degree distributions are described 
by power laws. For w = 4 we see two regions of scaling with different expo- 
nents 71 and 72 while a transition point between both scaling regimes appears 
at k w 100. The transition appears due to the fact that there are almost no 
companies with diversity over 100, so branches with k > 100 have connections 
due to several companies, as opposed to branches with k < 100 that can be 
connected due to a single company. However the probability that many com- 
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Fig. 6. Degree distributions for companies networks for different values of w . X- 
marks are for w a = 1, circles are for w = 2, squares are for w — 3 and triangles 
are for w — 12. 

panies link a single branch with many different others is low, thus the degree 
probability p(k) decays much faster after the transition point. In the Table 4 
we present values 71 and 72 for different cutoffs w . 

It is important to stress that in both networks (companies and branches) 
the scaling behavior for degree distribution occurs only if we use cutoffs for 
links weights, compare Fig. 6 and Fig. 7. It follows that such cutoffs act as 
filters for the noise present in the complex network topology. 

6 Entropy of network topology 

Having a probability distribution of node degrees one can calculated a cor- 
responding measure of network heterogeneity. We have used the standard 
formula for Gibbs entropy, i.e. 

S = -£>(fc)lnp(fc) (13) 

k 

The entropy of degree distribution in branches networks decays logarith- 
mically as a function of the cutoff value (Fig. 8) 



S = — aln(w ) + b 



(14) 
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Fig. 7. Degree distribution in branches network for different values of w . Circles 
are for w = 1, crosses are for w — 4. 

where a = 0.834 ± 0.004 and b — 6.51 ± 0.02. The entropy in companies 
networks behaves similarly with a = 1.79 ± 0.05 and 6 = 8. 49 ±0.15. 

Table 4. Values of scaling exponents 71 and 72 for branches networks. 
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Fig. 9. Dependence of entropy on the average nodes degree. Circles represent 
branches networks and X-marks represent companies networks. 



The behavior has the following explanation. Diversity of node degrees is 
decreasing with growing weight cutoff values w . Larger cutoffs reduce total 
number of links in the network what leads to a smaller range of k and thus to 
smaller values of k max and (k). The relation between S and (k) is presented 
at the Fig. 9, where a logarithmic scaling can be seen 



S ~ a ln(fe) 



(15) 
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with a = 1.052 ± 0.003 for branches networks and a = 1.062 ± 0.019 for 
companies networks. 

7 Clustering coefficient 

We have analyzed a clustering coefficient dependence on node degree in 
branches and companies networks. 




In the companies network the clustering coefficient for small values of k is 
close to one, for larger k the value of C(k) exhibits logarithmic behavior 

C~/31n/c (16) 

with /?i = — 0.174 ± 0.006. In branches networks the logarithmic behavior is 
present for the whole range of k with @ 2 — —0.111 ± 0.004. 



8 Conclusions 

In this study, we have collected and analyzed data on companies in Poland. 
48158 medium/large firms and 2150 branches form a bipartite graph that 
allows to construct weighted networks of companies and branches. 
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Link weights in both networks are very heterogenous and a corresponding 
link weight distribution in the branches network follows a power law. Remov- 
ing links with weights smaller than a cutoff (threshold) w acts as a kind of 
filtering for network topology. This results in recovery of a hidden scaling rela- 
tions present in the network. The degree distribution for companies networks 
changes with increasing w from a nonmonotonic function with an exponen- 
tial tail (for w = 1) to a power law (for w > 6). For a filtered (w > 4) 
branches network we see two regions of scaling with different exponents and 
a transition point between both regimes. Entropies of degree distributions of 
both networks decay logarithmically as a function of cutoff parameter and are 
proportional to the logarithm of the mean node degree. 
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